Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 50000 |
| Missing cells | 147570 |
| Missing cells (%) | 9.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 12.2 MiB |
| Average record size in memory | 256.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 13 |
| Boolean | 4 |
| Text | 3 |
ppm has constant value "" | Constant |
amort has constant value "" | Constant |
super_conforming has constant value "" | Constant |
program has constant value "" | Constant |
relief_refi has constant value "" | Constant |
prop_val has constant value "" | Constant |
interest_only has constant value "" | Constant |
MI_cancel has constant value "" | Constant |
first_pay is highly overall correlated with mat_date | High correlation |
mat_date is highly overall correlated with first_pay and 1 other fields | High correlation |
orig_cltv is highly overall correlated with orig_ltv | High correlation |
orig_ltv is highly overall correlated with orig_cltv | High correlation |
orig_loan_term is highly overall correlated with mat_date | High correlation |
channel is highly overall correlated with seller | High correlation |
seller is highly overall correlated with channel and 1 other fields | High correlation |
servicer is highly overall correlated with seller | High correlation |
fha is highly imbalanced (78.1%) | Imbalance |
unit_num is highly imbalanced (94.8%) | Imbalance |
occupancy is highly imbalanced (70.5%) | Imbalance |
prop_type is highly imbalanced (54.6%) | Imbalance |
msa has 9256 (18.5%) missing values | Missing |
super_conforming has 48944 (97.9%) missing values | Missing |
prr_loan_seq_num has 44685 (89.4%) missing values | Missing |
relief_refi has 44685 (89.4%) missing values | Missing |
credit_score is highly skewed (γ1 = 75.22765869) | Skewed |
loan_id has unique values | Unique |
mortgage_ins_pct has 46511 (93.0%) zeros | Zeros |
Reproduction
| Analysis started | 2023-11-13 19:39:12.410741 |
|---|---|
| Analysis finished | 2023-11-13 19:39:54.530149 |
| Duration | 42.12 seconds |
| Software version | ydata-profiling vv4.6.1 |
| Download configuration | config.json |
credit_score
Real number (ℝ)
SKEWED 
| Distinct | 316 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 761.20512 |
| Minimum | 431 |
|---|---|
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 431 |
|---|---|
| 5-th percentile | 680 |
| Q1 | 736 |
| median | 771 |
| Q3 | 792 |
| 95-th percentile | 809 |
| Maximum | 9999 |
| Range | 9568 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 101.53447 |
|---|---|
| Coefficient of variation (CV) | 0.13338648 |
| Kurtosis | 6850.1351 |
| Mean | 761.20512 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 75.227659 |
| Sum | 38060256 |
| Variance | 10309.249 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 801 | 723 | 1.4% |
| 802 | 716 | 1.4% |
| 797 | 692 | 1.4% |
| 790 | 692 | 1.4% |
| 791 | 669 | 1.3% |
| 798 | 669 | 1.3% |
| 809 | 661 | 1.3% |
| 793 | 660 | 1.3% |
| 786 | 658 | 1.3% |
| 787 | 651 | 1.3% |
| Other values (306) | 43209 |
| Value | Count | Frequency (%) |
| 431 | 1 | |
| 443 | 1 | |
| 470 | 1 | |
| 472 | 1 | |
| 480 | 1 | |
| 486 | 1 | |
| 491 | 1 | |
| 492 | 1 | |
| 494 | 1 | |
| 497 | 1 |
| Value | Count | Frequency (%) |
| 9999 | 5 | |
| 850 | 1 | < 0.1% |
| 835 | 1 | < 0.1% |
| 831 | 1 | < 0.1% |
| 829 | 3 | |
| 828 | 1 | < 0.1% |
| 827 | 3 | |
| 826 | 4 | |
| 825 | 5 | |
| 824 | 5 |
first_pay
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 200923.57 |
| Minimum | 200902 |
|---|---|
| Maximum | 201310 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 200902 |
|---|---|
| 5-th percentile | 200903 |
| Q1 | 200906 |
| median | 200908 |
| Q3 | 200912 |
| 95-th percentile | 201002 |
| Maximum | 201310 |
| Range | 408 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 35.57835 |
|---|---|
| Coefficient of variation (CV) | 0.00017707405 |
| Kurtosis | 1.3647377 |
| Mean | 200923.57 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.7557016 |
| Sum | 1.0046178 × 1010 |
| Variance | 1265.819 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200909 | 5282 | |
| 200905 | 4811 | |
| 200904 | 4471 | |
| 200908 | 4249 | |
| 200907 | 4247 | |
| 201002 | 4181 | |
| 201001 | 4137 | |
| 200906 | 4054 | |
| 200912 | 4015 | |
| 200910 | 3773 | |
| Other values (16) | 6780 |
| Value | Count | Frequency (%) |
| 200902 | 47 | 0.1% |
| 200903 | 3159 | |
| 200904 | 4471 | |
| 200905 | 4811 | |
| 200906 | 4054 | |
| 200907 | 4247 | |
| 200908 | 4249 | |
| 200909 | 5282 | |
| 200910 | 3773 | |
| 200911 | 3347 |
| Value | Count | Frequency (%) |
| 201310 | 1 | < 0.1% |
| 201209 | 1 | < 0.1% |
| 201107 | 1 | < 0.1% |
| 201012 | 1 | < 0.1% |
| 201011 | 1 | < 0.1% |
| 201010 | 4 | |
| 201009 | 6 | |
| 201008 | 9 | |
| 201007 | 5 | |
| 201006 | 8 |
fha
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| N | |
|---|---|
| Y | 3238 |
| 9 | 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 50000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | N |
|---|---|
| 2nd row | Y |
| 3rd row | N |
| 4th row | N |
| 5th row | N |
Common Values
| Value | Count | Frequency (%) |
| N | 46758 | |
| Y | 3238 | 6.5% |
| 9 | 4 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| n | 46758 | |
| y | 3238 | 6.5% |
| 9 | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 46758 | |
| Y | 3238 | 6.5% |
| 9 | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 49996 | |
| Decimal Number | 4 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 46758 | |
| Y | 3238 | 6.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49996 | |
| Common | 4 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 46758 | |
| Y | 3238 | 6.5% |
Common
| Value | Count | Frequency (%) |
| 9 | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 46758 | |
| Y | 3238 | 6.5% |
| 9 | 4 | < 0.1% |
mat_date
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 207 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 203587.62 |
| Minimum | 201411 |
|---|---|
| Maximum | 204011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 201411 |
|---|---|
| 5-th percentile | 202405 |
| Q1 | 203902 |
| median | 203906 |
| Q3 | 203909 |
| 95-th percentile | 204001 |
| Maximum | 204011 |
| Range | 2600 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 609.80327 |
|---|---|
| Coefficient of variation (CV) | 0.0029952866 |
| Kurtosis | 0.14700177 |
| Mean | 203587.62 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -1.4070794 |
| Sum | 1.0179381 × 1010 |
| Variance | 371860.03 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 203908 | 4056 | 8.1% |
| 203904 | 3797 | 7.6% |
| 203903 | 3490 | 7.0% |
| 203906 | 3225 | 6.5% |
| 203907 | 3221 | 6.4% |
| 203905 | 3135 | 6.3% |
| 203912 | 3023 | 6.0% |
| 203911 | 3017 | 6.0% |
| 204001 | 2993 | 6.0% |
| 203909 | 2908 | 5.8% |
| Other values (197) | 17135 |
| Value | Count | Frequency (%) |
| 201411 | 1 | < 0.1% |
| 201710 | 1 | < 0.1% |
| 201712 | 1 | < 0.1% |
| 201902 | 19 | < 0.1% |
| 201903 | 37 | |
| 201904 | 43 | |
| 201905 | 35 | |
| 201906 | 39 | |
| 201907 | 30 | |
| 201908 | 53 |
| Value | Count | Frequency (%) |
| 204011 | 1 | < 0.1% |
| 204010 | 1 | < 0.1% |
| 204009 | 3 | < 0.1% |
| 204008 | 2 | < 0.1% |
| 204007 | 8 | < 0.1% |
| 204006 | 5 | < 0.1% |
| 204005 | 7 | < 0.1% |
| 204004 | 3 | < 0.1% |
| 204003 | 8 | < 0.1% |
| 204002 | 109 |
msa
Real number (ℝ)
MISSING 
| Distinct | 430 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 9256 |
| Missing (%) | 18.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30225.759 |
| Minimum | 10180 |
|---|---|
| Maximum | 49740 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 10180 |
|---|---|
| 5-th percentile | 12420 |
| Q1 | 19124 |
| median | 31700 |
| Q3 | 40060 |
| 95-th percentile | 47644 |
| Maximum | 49740 |
| Range | 39560 |
| Interquartile range (IQR) | 20936 |
Descriptive statistics
| Standard deviation | 11333.055 |
|---|---|
| Coefficient of variation (CV) | 0.37494692 |
| Kurtosis | -1.2806112 |
| Mean | 30225.759 |
| Median Absolute Deviation (MAD) | 9920 |
| Skewness | -0.16199969 |
| Sum | 1.2315183 × 109 |
| Variance | 1.2843814 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16974 | 1585 | 3.2% |
| 31084 | 1090 | 2.2% |
| 33460 | 869 | 1.7% |
| 47894 | 816 | 1.6% |
| 12060 | 814 | 1.6% |
| 41180 | 717 | 1.4% |
| 42644 | 663 | 1.3% |
| 19740 | 646 | 1.3% |
| 38060 | 627 | 1.3% |
| 38900 | 624 | 1.2% |
| Other values (420) | 32293 | |
| (Missing) | 9256 | 18.5% |
| Value | Count | Frequency (%) |
| 10180 | 8 | < 0.1% |
| 10420 | 98 | |
| 10500 | 11 | < 0.1% |
| 10540 | 3 | < 0.1% |
| 10580 | 122 | |
| 10740 | 163 | |
| 10780 | 3 | < 0.1% |
| 10900 | 155 | |
| 11020 | 15 | < 0.1% |
| 11100 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 49740 | 3 | < 0.1% |
| 49700 | 15 | < 0.1% |
| 49660 | 34 | 0.1% |
| 49620 | 88 | |
| 49420 | 23 | < 0.1% |
| 49340 | 173 | |
| 49180 | 107 | |
| 49020 | 22 | < 0.1% |
| 48900 | 108 | |
| 48864 | 121 |
mortgage_ins_pct
Real number (ℝ)
ZEROS 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.55514 |
| Minimum | 0 |
|---|---|
| Maximum | 35 |
| Zeros | 46511 |
| Zeros (%) | 93.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 17 |
| Maximum | 35 |
| Range | 35 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5.96548 |
|---|---|
| Coefficient of variation (CV) | 3.8359762 |
| Kurtosis | 13.438167 |
| Mean | 1.55514 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.8325425 |
| Sum | 77757 |
| Variance | 35.586951 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 46511 | |
| 25 | 1487 | 3.0% |
| 30 | 732 | 1.5% |
| 12 | 662 | 1.3% |
| 17 | 412 | 0.8% |
| 6 | 79 | 0.2% |
| 35 | 59 | 0.1% |
| 20 | 42 | 0.1% |
| 18 | 9 | < 0.1% |
| 21 | 2 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 46511 | |
| 6 | 79 | 0.2% |
| 9 | 1 | < 0.1% |
| 12 | 662 | 1.3% |
| 15 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 412 | 0.8% |
| 18 | 9 | < 0.1% |
| 19 | 1 | < 0.1% |
| 20 | 42 | 0.1% |
| Value | Count | Frequency (%) |
| 35 | 59 | 0.1% |
| 32 | 1 | < 0.1% |
| 30 | 732 | |
| 25 | 1487 | |
| 21 | 2 | < 0.1% |
| 20 | 42 | 0.1% |
| 19 | 1 | < 0.1% |
| 18 | 9 | < 0.1% |
| 17 | 412 | 0.8% |
| 16 | 1 | < 0.1% |
unit_num
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 1 | |
|---|---|
| 2 | 400 |
| 4 | 105 |
| 3 | 74 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 50000 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 49421 | |
| 2 | 400 | 0.8% |
| 4 | 105 | 0.2% |
| 3 | 74 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 49421 | |
| 2 | 400 | 0.8% |
| 4 | 105 | 0.2% |
| 3 | 74 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 49421 | |
| 2 | 400 | 0.8% |
| 4 | 105 | 0.2% |
| 3 | 74 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 50000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 49421 | |
| 2 | 400 | 0.8% |
| 4 | 105 | 0.2% |
| 3 | 74 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 50000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 49421 | |
| 2 | 400 | 0.8% |
| 4 | 105 | 0.2% |
| 3 | 74 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 49421 | |
| 2 | 400 | 0.8% |
| 4 | 105 | 0.2% |
| 3 | 74 | 0.1% |
occupancy
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| P | |
|---|---|
| S | 2216 |
| I | 1638 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 50000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P |
|---|---|
| 2nd row | P |
| 3rd row | P |
| 4th row | P |
| 5th row | P |
Common Values
| Value | Count | Frequency (%) |
| P | 46146 | |
| S | 2216 | 4.4% |
| I | 1638 | 3.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| p | 46146 | |
| s | 2216 | 4.4% |
| i | 1638 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 46146 | |
| S | 2216 | 4.4% |
| I | 1638 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 50000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 46146 | |
| S | 2216 | 4.4% |
| I | 1638 | 3.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 46146 | |
| S | 2216 | 4.4% |
| I | 1638 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 46146 | |
| S | 2216 | 4.4% |
| I | 1638 | 3.3% |
orig_cltv
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 149 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68.36696 |
| Minimum | 5 |
|---|---|
| Maximum | 212 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 33 |
| Q1 | 57 |
| median | 73 |
| Q3 | 80 |
| 95-th percentile | 95 |
| Maximum | 212 |
| Range | 207 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 18.340819 |
|---|---|
| Coefficient of variation (CV) | 0.26827021 |
| Kurtosis | 0.62136391 |
| Mean | 68.36696 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -0.41936005 |
| Sum | 3418348 |
| Variance | 336.38563 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 8719 | 17.4% |
| 75 | 2420 | 4.8% |
| 90 | 1701 | 3.4% |
| 70 | 1300 | 2.6% |
| 60 | 1201 | 2.4% |
| 79 | 1170 | 2.3% |
| 74 | 1063 | 2.1% |
| 95 | 1029 | 2.1% |
| 78 | 1016 | 2.0% |
| 73 | 975 | 1.9% |
| Other values (139) | 29406 |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 10 | < 0.1% |
| 9 | 3 | < 0.1% |
| 10 | 10 | < 0.1% |
| 11 | 7 | < 0.1% |
| 12 | 19 | |
| 13 | 19 | |
| 14 | 30 | |
| 15 | 42 |
| Value | Count | Frequency (%) |
| 212 | 1 | |
| 206 | 1 | |
| 193 | 1 | |
| 183 | 2 | |
| 181 | 1 | |
| 176 | 1 | |
| 173 | 1 | |
| 172 | 1 | |
| 169 | 1 | |
| 151 | 1 |
orig_dti
Real number (ℝ)
| Distinct | 66 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 136.65418 |
| Minimum | 1 |
|---|---|
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 14 |
| Q1 | 24 |
| median | 33 |
| Q3 | 44 |
| 95-th percentile | 999 |
| Maximum | 999 |
| Range | 998 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 300.98115 |
|---|---|
| Coefficient of variation (CV) | 2.2025023 |
| Kurtosis | 4.3236369 |
| Mean | 136.65418 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 2.5119297 |
| Sum | 6832709 |
| Variance | 90589.651 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 999 | 5423 | 10.8% |
| 26 | 1438 | 2.9% |
| 24 | 1352 | 2.7% |
| 28 | 1348 | 2.7% |
| 27 | 1344 | 2.7% |
| 31 | 1343 | 2.7% |
| 25 | 1332 | 2.7% |
| 29 | 1326 | 2.7% |
| 30 | 1324 | 2.6% |
| 33 | 1313 | 2.6% |
| Other values (56) | 32457 |
| Value | Count | Frequency (%) |
| 1 | 5 | < 0.1% |
| 2 | 10 | < 0.1% |
| 3 | 18 | < 0.1% |
| 4 | 27 | 0.1% |
| 5 | 52 | 0.1% |
| 6 | 60 | 0.1% |
| 7 | 100 | 0.2% |
| 8 | 144 | |
| 9 | 229 | |
| 10 | 250 |
| Value | Count | Frequency (%) |
| 999 | 5423 | |
| 65 | 21 | < 0.1% |
| 64 | 38 | 0.1% |
| 63 | 35 | 0.1% |
| 62 | 46 | 0.1% |
| 61 | 39 | 0.1% |
| 60 | 40 | 0.1% |
| 59 | 63 | 0.1% |
| 58 | 57 | 0.1% |
| 57 | 72 | 0.1% |
orig_upb
Real number (ℝ)
| Distinct | 670 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 212505.54 |
| Minimum | 8000 |
|---|---|
| Maximum | 790000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 8000 |
|---|---|
| 5-th percentile | 68000 |
| Q1 | 123000 |
| median | 188000 |
| Q3 | 280000 |
| 95-th percentile | 417000 |
| Maximum | 790000 |
| Range | 782000 |
| Interquartile range (IQR) | 157000 |
Descriptive statistics
| Standard deviation | 115814.41 |
|---|---|
| Coefficient of variation (CV) | 0.54499479 |
| Kurtosis | 1.111691 |
| Mean | 212505.54 |
| Median Absolute Deviation (MAD) | 74000 |
| Skewness | 0.98875167 |
| Sum | 1.0625277 × 1010 |
| Variance | 1.3412978 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 417000 | 1993 | 4.0% |
| 100000 | 615 | 1.2% |
| 200000 | 614 | 1.2% |
| 150000 | 483 | 1.0% |
| 300000 | 413 | 0.8% |
| 120000 | 409 | 0.8% |
| 140000 | 392 | 0.8% |
| 160000 | 379 | 0.8% |
| 180000 | 368 | 0.7% |
| 250000 | 351 | 0.7% |
| Other values (660) | 43983 |
| Value | Count | Frequency (%) |
| 8000 | 1 | < 0.1% |
| 10000 | 1 | < 0.1% |
| 13000 | 1 | < 0.1% |
| 15000 | 1 | < 0.1% |
| 16000 | 1 | < 0.1% |
| 17000 | 3 | |
| 18000 | 2 | |
| 19000 | 3 | |
| 20000 | 2 | |
| 21000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 790000 | 1 | < 0.1% |
| 788000 | 1 | < 0.1% |
| 776000 | 1 | < 0.1% |
| 730000 | 66 | |
| 729000 | 19 | < 0.1% |
| 728000 | 5 | < 0.1% |
| 726000 | 1 | < 0.1% |
| 725000 | 3 | < 0.1% |
| 722000 | 3 | < 0.1% |
| 721000 | 2 | < 0.1% |
orig_ltv
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 121 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.5119 |
| Minimum | 5 |
|---|---|
| Maximum | 125 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 32 |
| Q1 | 55 |
| median | 71 |
| Q3 | 80 |
| 95-th percentile | 90 |
| Maximum | 125 |
| Range | 120 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 17.695917 |
|---|---|
| Coefficient of variation (CV) | 0.26605641 |
| Kurtosis | -0.019401098 |
| Mean | 66.5119 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -0.63517569 |
| Sum | 3325595 |
| Variance | 313.14548 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 9161 | 18.3% |
| 75 | 2472 | 4.9% |
| 70 | 1320 | 2.6% |
| 60 | 1292 | 2.6% |
| 90 | 1284 | 2.6% |
| 79 | 1206 | 2.4% |
| 74 | 1089 | 2.2% |
| 78 | 1033 | 2.1% |
| 73 | 979 | 2.0% |
| 69 | 926 | 1.9% |
| Other values (111) | 29238 |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 10 | < 0.1% |
| 9 | 3 | < 0.1% |
| 10 | 13 | < 0.1% |
| 11 | 9 | < 0.1% |
| 12 | 21 | |
| 13 | 20 | |
| 14 | 36 |
| Value | Count | Frequency (%) |
| 125 | 3 | |
| 124 | 3 | |
| 123 | 2 | |
| 122 | 4 | |
| 121 | 1 | < 0.1% |
| 120 | 1 | < 0.1% |
| 119 | 2 | |
| 118 | 1 | < 0.1% |
| 117 | 3 | |
| 116 | 3 |
orig_int
Real number (ℝ)
| Distinct | 260 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.9782446 |
| Minimum | 3.5 |
|---|---|
| Maximum | 7.875 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 3.5 |
|---|---|
| 5-th percentile | 4.375 |
| Q1 | 4.75 |
| median | 4.875 |
| Q3 | 5.25 |
| 95-th percentile | 5.625 |
| Maximum | 7.875 |
| Range | 4.375 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.38970043 |
|---|---|
| Coefficient of variation (CV) | 0.07828069 |
| Kurtosis | 1.2562636 |
| Mean | 4.9782446 |
| Median Absolute Deviation (MAD) | 0.25 |
| Skewness | 0.69814072 |
| Sum | 248912.23 |
| Variance | 0.15186642 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.875 | 9112 | |
| 4.75 | 7781 | |
| 5 | 4970 | |
| 5.25 | 4789 | |
| 5.125 | 3764 | |
| 5.375 | 3601 | 7.2% |
| 4.5 | 2985 | 6.0% |
| 4.625 | 2942 | 5.9% |
| 5.5 | 2289 | 4.6% |
| 4.375 | 1917 | 3.8% |
| Other values (250) | 5850 |
| Value | Count | Frequency (%) |
| 3.5 | 1 | < 0.1% |
| 3.75 | 2 | < 0.1% |
| 3.875 | 3 | < 0.1% |
| 4 | 9 | < 0.1% |
| 4.125 | 11 | < 0.1% |
| 4.25 | 1532 | |
| 4.26 | 2 | < 0.1% |
| 4.27 | 1 | < 0.1% |
| 4.272 | 1 | < 0.1% |
| 4.275 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7.875 | 1 | < 0.1% |
| 7.5 | 2 | < 0.1% |
| 7.375 | 1 | < 0.1% |
| 7.25 | 2 | < 0.1% |
| 7.125 | 3 | < 0.1% |
| 7 | 5 | < 0.1% |
| 6.875 | 14 | < 0.1% |
| 6.75 | 18 | < 0.1% |
| 6.625 | 40 | |
| 6.5 | 90 |
channel
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| R | |
|---|---|
| C | |
| B |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 50000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | R |
|---|---|
| 2nd row | R |
| 3rd row | R |
| 4th row | R |
| 5th row | R |
Common Values
| Value | Count | Frequency (%) |
| R | 29610 | |
| C | 12415 | |
| B | 7975 | 16.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| r | 29610 | |
| c | 12415 | |
| b | 7975 | 16.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 29610 | |
| C | 12415 | |
| B | 7975 | 16.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 50000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 29610 | |
| C | 12415 | |
| B | 7975 | 16.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 29610 | |
| C | 12415 | |
| B | 7975 | 16.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 29610 | |
| C | 12415 | |
| B | 7975 | 16.0% |
ppm
Boolean
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.0 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 50000 |
amort
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| FRM |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 150000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FRM |
|---|---|
| 2nd row | FRM |
| 3rd row | FRM |
| 4th row | FRM |
| 5th row | FRM |
Common Values
| Value | Count | Frequency (%) |
| FRM | 50000 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| frm | 50000 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 50000 | |
| R | 50000 | |
| M | 50000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 150000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 50000 | |
| R | 50000 | |
| M | 50000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 150000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 50000 | |
| R | 50000 | |
| M | 50000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 150000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| F | 50000 | |
| R | 50000 | |
| M | 50000 |
prop_state
Text
| Distinct | 54 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 100000 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MN |
|---|---|
| 2nd row | NY |
| 3rd row | WA |
| 4th row | NE |
| 5th row | NE |
| Value | Count | Frequency (%) |
| ca | 5791 | 11.6% |
| il | 3072 | 6.1% |
| tx | 2164 | 4.3% |
| nc | 1911 | 3.8% |
| oh | 1902 | 3.8% |
| ny | 1889 | 3.8% |
| pa | 1870 | 3.7% |
| wi | 1698 | 3.4% |
| wa | 1672 | 3.3% |
| fl | 1627 | 3.3% |
| Other values (44) | 26404 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 16668 | |
| C | 10544 | |
| N | 10410 | |
| I | 8978 | 9.0% |
| M | 7893 | 7.9% |
| O | 5943 | 5.9% |
| L | 5574 | 5.6% |
| T | 4786 | 4.8% |
| W | 3628 | 3.6% |
| Y | 2814 | 2.8% |
| Other values (14) | 22762 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 100000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 16668 | |
| C | 10544 | |
| N | 10410 | |
| I | 8978 | 9.0% |
| M | 7893 | 7.9% |
| O | 5943 | 5.9% |
| L | 5574 | 5.6% |
| T | 4786 | 4.8% |
| W | 3628 | 3.6% |
| Y | 2814 | 2.8% |
| Other values (14) | 22762 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 100000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 16668 | |
| C | 10544 | |
| N | 10410 | |
| I | 8978 | 9.0% |
| M | 7893 | 7.9% |
| O | 5943 | 5.9% |
| L | 5574 | 5.6% |
| T | 4786 | 4.8% |
| W | 3628 | 3.6% |
| Y | 2814 | 2.8% |
| Other values (14) | 22762 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 100000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 16668 | |
| C | 10544 | |
| N | 10410 | |
| I | 8978 | 9.0% |
| M | 7893 | 7.9% |
| O | 5943 | 5.9% |
| L | 5574 | 5.6% |
| T | 4786 | 4.8% |
| W | 3628 | 3.6% |
| Y | 2814 | 2.8% |
| Other values (14) | 22762 |
prop_type
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| SF | |
|---|---|
| PU | |
| CO | 2795 |
| CP | 143 |
| MH | 121 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 100000 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SF |
|---|---|
| 2nd row | SF |
| 3rd row | SF |
| 4th row | PU |
| 5th row | SF |
Common Values
| Value | Count | Frequency (%) |
| SF | 37242 | |
| PU | 9699 | 19.4% |
| CO | 2795 | 5.6% |
| CP | 143 | 0.3% |
| MH | 121 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sf | 37242 | |
| pu | 9699 | 19.4% |
| co | 2795 | 5.6% |
| cp | 143 | 0.3% |
| mh | 121 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 37242 | |
| F | 37242 | |
| P | 9842 | 9.8% |
| U | 9699 | 9.7% |
| C | 2938 | 2.9% |
| O | 2795 | 2.8% |
| M | 121 | 0.1% |
| H | 121 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 100000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 37242 | |
| F | 37242 | |
| P | 9842 | 9.8% |
| U | 9699 | 9.7% |
| C | 2938 | 2.9% |
| O | 2795 | 2.8% |
| M | 121 | 0.1% |
| H | 121 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 100000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 37242 | |
| F | 37242 | |
| P | 9842 | 9.8% |
| U | 9699 | 9.7% |
| C | 2938 | 2.9% |
| O | 2795 | 2.8% |
| M | 121 | 0.1% |
| H | 121 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 100000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 37242 | |
| F | 37242 | |
| P | 9842 | 9.8% |
| U | 9699 | 9.7% |
| C | 2938 | 2.9% |
| O | 2795 | 2.8% |
| M | 121 | 0.1% |
| H | 121 | 0.1% |
prop_zip
Real number (ℝ)
| Distinct | 875 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52283.858 |
| Minimum | 600 |
|---|---|
| Maximum | 99900 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 600 |
|---|---|
| 5-th percentile | 5400 |
| Q1 | 27700 |
| median | 53000 |
| Q3 | 80200 |
| 95-th percentile | 97100 |
| Maximum | 99900 |
| Range | 99300 |
| Interquartile range (IQR) | 52500 |
Descriptive statistics
| Standard deviation | 29933.93 |
|---|---|
| Coefficient of variation (CV) | 0.57252718 |
| Kurtosis | -1.2294483 |
| Mean | 52283.858 |
| Median Absolute Deviation (MAD) | 25700 |
| Skewness | 4.8581779 × 10-6 |
| Sum | 2.6141929 × 109 |
| Variance | 8.9604016 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 94500 | 565 | 1.1% |
| 60000 | 507 | 1.0% |
| 84000 | 436 | 0.9% |
| 60600 | 432 | 0.9% |
| 60100 | 423 | 0.8% |
| 75000 | 412 | 0.8% |
| 98000 | 409 | 0.8% |
| 30000 | 408 | 0.8% |
| 60500 | 383 | 0.8% |
| 92600 | 325 | 0.7% |
| Other values (865) | 45700 |
| Value | Count | Frequency (%) |
| 600 | 12 | < 0.1% |
| 700 | 23 | < 0.1% |
| 800 | 3 | < 0.1% |
| 900 | 32 | 0.1% |
| 1000 | 79 | |
| 1100 | 13 | < 0.1% |
| 1200 | 16 | < 0.1% |
| 1300 | 16 | < 0.1% |
| 1400 | 52 | |
| 1500 | 117 |
| Value | Count | Frequency (%) |
| 99900 | 2 | < 0.1% |
| 99800 | 11 | < 0.1% |
| 99700 | 7 | < 0.1% |
| 99600 | 35 | 0.1% |
| 99500 | 90 | |
| 99400 | 5 | < 0.1% |
| 99300 | 55 | |
| 99200 | 82 | |
| 99100 | 14 | < 0.1% |
| 99000 | 38 |
loan_id
Text
UNIQUE 
| Distinct | 50000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Characters and Unicode
| Total characters | 600000 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 50000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | F09Q10000013 |
|---|---|
| 2nd row | F09Q10000078 |
| 3rd row | F09Q10000148 |
| 4th row | F09Q10000154 |
| 5th row | F09Q10000180 |
| Value | Count | Frequency (%) |
| f09q10000013 | 1 | < 0.1% |
| f09q10002187 | 1 | < 0.1% |
| f09q10000787 | 1 | < 0.1% |
| f09q10000556 | 1 | < 0.1% |
| f09q10000148 | 1 | < 0.1% |
| f09q10000154 | 1 | < 0.1% |
| f09q10000180 | 1 | < 0.1% |
| f09q10000181 | 1 | < 0.1% |
| f09q10000183 | 1 | < 0.1% |
| f09q10000216 | 1 | < 0.1% |
| Other values (49990) | 49990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 134574 | |
| 9 | 74766 | |
| F | 50000 | 8.3% |
| Q | 50000 | 8.3% |
| 1 | 47400 | 7.9% |
| 2 | 47157 | 7.9% |
| 3 | 47059 | 7.8% |
| 4 | 44493 | 7.4% |
| 5 | 28290 | 4.7% |
| 6 | 26704 | 4.5% |
| Other values (2) | 49557 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 500000 | |
| Uppercase Letter | 100000 | 16.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 134574 | |
| 9 | 74766 | |
| 1 | 47400 | 9.5% |
| 2 | 47157 | 9.4% |
| 3 | 47059 | 9.4% |
| 4 | 44493 | 8.9% |
| 5 | 28290 | 5.7% |
| 6 | 26704 | 5.3% |
| 7 | 25011 | 5.0% |
| 8 | 24546 | 4.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 50000 | |
| Q | 50000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 500000 | |
| Latin | 100000 | 16.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 134574 | |
| 9 | 74766 | |
| 1 | 47400 | 9.5% |
| 2 | 47157 | 9.4% |
| 3 | 47059 | 9.4% |
| 4 | 44493 | 8.9% |
| 5 | 28290 | 5.7% |
| 6 | 26704 | 5.3% |
| 7 | 25011 | 5.0% |
| 8 | 24546 | 4.9% |
Latin
| Value | Count | Frequency (%) |
| F | 50000 | |
| Q | 50000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 600000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 134574 | |
| 9 | 74766 | |
| F | 50000 | 8.3% |
| Q | 50000 | 8.3% |
| 1 | 47400 | 7.9% |
| 2 | 47157 | 7.9% |
| 3 | 47059 | 7.8% |
| 4 | 44493 | 7.4% |
| 5 | 28290 | 4.7% |
| 6 | 26704 | 4.5% |
| Other values (2) | 49557 | 8.3% |
loan_purp
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| N | |
|---|---|
| C | |
| P |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 50000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | P |
| 3rd row | C |
| 4th row | N |
| 5th row | N |
Common Values
| Value | Count | Frequency (%) |
| N | 25554 | |
| C | 14123 | |
| P | 10323 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| n | 25554 | |
| c | 14123 | |
| p | 10323 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 25554 | |
| C | 14123 | |
| P | 10323 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 50000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 25554 | |
| C | 14123 | |
| P | 10323 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 25554 | |
| C | 14123 | |
| P | 10323 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 25554 | |
| C | 14123 | |
| P | 10323 |
orig_loan_term
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 121 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 320.68482 |
| Minimum | 60 |
|---|---|
| Maximum | 360 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 60 |
|---|---|
| 5-th percentile | 180 |
| Q1 | 360 |
| median | 360 |
| Q3 | 360 |
| 95-th percentile | 360 |
| Maximum | 360 |
| Range | 300 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 73.252147 |
|---|---|
| Coefficient of variation (CV) | 0.22842412 |
| Kurtosis | 0.14735424 |
| Mean | 320.68482 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -1.4113492 |
| Sum | 16034241 |
| Variance | 5365.877 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 360 | 38238 | |
| 180 | 8806 | 17.6% |
| 240 | 1637 | 3.3% |
| 120 | 488 | 1.0% |
| 300 | 431 | 0.9% |
| 144 | 29 | 0.1% |
| 324 | 23 | < 0.1% |
| 336 | 22 | < 0.1% |
| 168 | 18 | < 0.1% |
| 156 | 18 | < 0.1% |
| Other values (111) | 290 | 0.6% |
| Value | Count | Frequency (%) |
| 60 | 1 | < 0.1% |
| 96 | 1 | < 0.1% |
| 101 | 1 | < 0.1% |
| 119 | 1 | < 0.1% |
| 120 | 488 | |
| 121 | 14 | < 0.1% |
| 130 | 1 | < 0.1% |
| 131 | 1 | < 0.1% |
| 132 | 8 | < 0.1% |
| 135 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 360 | 38238 | |
| 359 | 5 | < 0.1% |
| 358 | 2 | < 0.1% |
| 357 | 2 | < 0.1% |
| 356 | 4 | < 0.1% |
| 355 | 2 | < 0.1% |
| 354 | 4 | < 0.1% |
| 353 | 4 | < 0.1% |
| 352 | 1 | < 0.1% |
| 351 | 2 | < 0.1% |
borrower_num
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 2 | |
|---|---|
| 1 | |
| 99 | 6 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.00012 |
| Min length | 1 |
Characters and Unicode
| Total characters | 50006 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 30704 | |
| 1 | 19290 | |
| 99 | 6 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 30704 | |
| 1 | 19290 | |
| 99 | 6 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 30704 | |
| 1 | 19290 | |
| 9 | 12 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 50006 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 30704 | |
| 1 | 19290 | |
| 9 | 12 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 50006 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 30704 | |
| 1 | 19290 | |
| 9 | 12 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50006 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 30704 | |
| 1 | 19290 | |
| 9 | 12 | < 0.1% |
seller
Categorical
HIGH CORRELATION 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| WELLS FARGO BANK, N.A. | |
|---|---|
| Other sellers | |
| BANK OF AMERICA, N.A. | |
| U.S. BANK N.A. | |
| CHASE HOME FINANCE LLC | |
| Other values (12) |
Length
| Max length | 52 |
|---|---|
| Median length | 38 |
| Mean length | 20.50626 |
| Min length | 12 |
Characters and Unicode
| Total characters | 1025313 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Other sellers |
|---|---|
| 2nd row | Other sellers |
| 3rd row | Other sellers |
| 4th row | Other sellers |
| 5th row | Other sellers |
Common Values
| Value | Count | Frequency (%) |
| WELLS FARGO BANK, N.A. | 13171 | |
| Other sellers | 11972 | |
| BANK OF AMERICA, N.A. | 4522 | 9.0% |
| U.S. BANK N.A. | 3762 | 7.5% |
| CHASE HOME FINANCE LLC | 3580 | 7.2% |
| BRANCH BANKING & TRUST COMPANY | 2261 | 4.5% |
| PROVIDENT FUNDING ASSOCIATES, L.P. | 1894 | 3.8% |
| CITIMORTGAGE, INC. | 1655 | 3.3% |
| FIFTH THIRD BANK | 1541 | 3.1% |
| METLIFE HOME LOANS, A DIVISION OF METLIFE BANK, N.A. | 1440 | 2.9% |
| Other values (7) | 4202 | 8.4% |
Length
| Value | Count | Frequency (%) |
| bank | 25790 | |
| n.a | 22895 | |
| wells | 13171 | 7.6% |
| fargo | 13171 | 7.6% |
| other | 11972 | 6.9% |
| sellers | 11972 | 6.9% |
| of | 5962 | 3.4% |
| home | 5020 | 2.9% |
| llc | 4576 | 2.6% |
| america | 4522 | 2.6% |
| Other values (34) | 55132 |
Most occurring characters
| Value | Count | Frequency (%) |
| 124183 | 12.1% | |
| A | 101402 | 9.9% |
| N | 78263 | 7.6% |
| . | 60210 | 5.9% |
| O | 52422 | 5.1% |
| E | 45396 | 4.4% |
| L | 42875 | 4.2% |
| e | 35916 | 3.5% |
| R | 35034 | 3.4% |
| S | 34751 | 3.4% |
| Other values (22) | 414861 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 679229 | |
| Lowercase Letter | 131692 | 12.8% |
| Space Separator | 124183 | 12.1% |
| Other Punctuation | 90209 | 8.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 101402 | |
| N | 78263 | |
| O | 52422 | 7.7% |
| E | 45396 | 6.7% |
| L | 42875 | 6.3% |
| R | 35034 | 5.2% |
| S | 34751 | 5.1% |
| I | 33536 | 4.9% |
| B | 31479 | 4.6% |
| F | 31431 | 4.6% |
| Other values (12) | 192640 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 35916 | |
| r | 23944 | |
| s | 23944 | |
| l | 23944 | |
| h | 11972 | 9.1% |
| t | 11972 | 9.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 60210 | |
| , | 27002 | |
| & | 2997 | 3.3% |
Space Separator
| Value | Count | Frequency (%) |
| 124183 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 810921 | |
| Common | 214392 | 20.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 101402 | 12.5% |
| N | 78263 | 9.7% |
| O | 52422 | 6.5% |
| E | 45396 | 5.6% |
| L | 42875 | 5.3% |
| e | 35916 | 4.4% |
| R | 35034 | 4.3% |
| S | 34751 | 4.3% |
| I | 33536 | 4.1% |
| B | 31479 | 3.9% |
| Other values (18) | 319847 |
Common
| Value | Count | Frequency (%) |
| 124183 | ||
| . | 60210 | |
| , | 27002 | 12.6% |
| & | 2997 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1025313 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 124183 | 12.1% | |
| A | 101402 | 9.9% |
| N | 78263 | 7.6% |
| . | 60210 | 5.9% |
| O | 52422 | 5.1% |
| E | 45396 | 4.4% |
| L | 42875 | 4.2% |
| e | 35916 | 3.5% |
| R | 35034 | 3.4% |
| S | 34751 | 3.4% |
| Other values (22) | 414861 |
servicer
Categorical
HIGH CORRELATION 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| Other servicers | |
|---|---|
| WELLS FARGO BANK, N.A. | |
| U.S. BANK N.A. | |
| BANK OF AMERICA, N.A. | |
| JPMORGAN CHASE BANK, N.A. | |
| Other values (14) |
Length
| Max length | 52 |
|---|---|
| Median length | 41 |
| Mean length | 20.57748 |
| Min length | 11 |
Characters and Unicode
| Total characters | 1028874 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | U.S. BANK N.A. |
|---|---|
| 2nd row | Other servicers |
| 3rd row | U.S. BANK N.A. |
| 4th row | Other servicers |
| 5th row | Other servicers |
Common Values
| Value | Count | Frequency (%) |
| Other servicers | 14697 | |
| WELLS FARGO BANK, N.A. | 13243 | |
| U.S. BANK N.A. | 4689 | 9.4% |
| BANK OF AMERICA, N.A. | 4304 | 8.6% |
| JPMORGAN CHASE BANK, N.A. | 3627 | 7.3% |
| PROVIDENT FUNDING ASSOCIATES, L.P. | 1837 | 3.7% |
| BRANCH BANKING & TRUST COMPANY | 1772 | 3.5% |
| CITIMORTGAGE, INC. | 1403 | 2.8% |
| FIFTH THIRD BANK | 986 | 2.0% |
| METLIFE HOME LOANS, A DIVISION OF METLIFE BANK, N.A. | 928 | 1.9% |
| Other values (9) | 2514 | 5.0% |
Length
| Value | Count | Frequency (%) |
| bank | 28961 | |
| n.a | 26791 | |
| other | 14697 | 8.8% |
| servicers | 14697 | 8.8% |
| wells | 13243 | 7.9% |
| fargo | 13243 | 7.9% |
| of | 5232 | 3.1% |
| u.s | 4689 | 2.8% |
| america | 4304 | 2.6% |
| jpmorgan | 4287 | 2.6% |
| Other values (35) | 37162 |
Most occurring characters
| Value | Count | Frequency (%) |
| 117306 | 11.4% | |
| A | 103663 | 10.1% |
| N | 80797 | 7.9% |
| . | 68590 | 6.7% |
| O | 51829 | 5.0% |
| e | 44091 | 4.3% |
| r | 44091 | 4.3% |
| S | 34473 | 3.4% |
| E | 33452 | 3.3% |
| R | 32685 | 3.2% |
| Other values (25) | 417897 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 622377 | |
| Lowercase Letter | 191061 | 18.6% |
| Space Separator | 117306 | 11.4% |
| Other Punctuation | 98130 | 9.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 103663 | |
| N | 80797 | |
| O | 51829 | 8.3% |
| S | 34473 | 5.5% |
| E | 33452 | 5.4% |
| R | 32685 | 5.3% |
| L | 32586 | 5.2% |
| B | 32505 | 5.2% |
| K | 30733 | 4.9% |
| G | 26913 | 4.3% |
| Other values (13) | 162741 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 44091 | |
| r | 44091 | |
| s | 29394 | |
| c | 14697 | 7.7% |
| i | 14697 | 7.7% |
| v | 14697 | 7.7% |
| h | 14697 | 7.7% |
| t | 14697 | 7.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 68590 | |
| , | 27768 | |
| & | 1772 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 117306 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 813438 | |
| Common | 215436 | 20.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 103663 | 12.7% |
| N | 80797 | 9.9% |
| O | 51829 | 6.4% |
| e | 44091 | 5.4% |
| r | 44091 | 5.4% |
| S | 34473 | 4.2% |
| E | 33452 | 4.1% |
| R | 32685 | 4.0% |
| L | 32586 | 4.0% |
| B | 32505 | 4.0% |
| Other values (21) | 323266 |
Common
| Value | Count | Frequency (%) |
| 117306 | ||
| . | 68590 | |
| , | 27768 | 12.9% |
| & | 1772 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1028874 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 117306 | 11.4% | |
| A | 103663 | 10.1% |
| N | 80797 | 7.9% |
| . | 68590 | 6.7% |
| O | 51829 | 5.0% |
| e | 44091 | 4.3% |
| r | 44091 | 4.3% |
| S | 34473 | 3.4% |
| E | 33452 | 3.3% |
| R | 32685 | 3.2% |
| Other values (25) | 417897 |
super_conforming
Boolean
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 48944 |
| Missing (%) | 97.9% |
| Memory size | 97.8 KiB |
| True | 1056 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 1056 | 2.1% |
| (Missing) | 48944 |
prr_loan_seq_num
Text
MISSING 
| Distinct | 5315 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 44685 |
| Missing (%) | 89.4% |
| Memory size | 390.8 KiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Characters and Unicode
| Total characters | 63780 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5315 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | F06Q10344882 |
|---|---|
| 2nd row | F04Q20024019 |
| 3rd row | F06Q30357730 |
| 4th row | F07Q40192786 |
| 5th row | F05Q30012809 |
| Value | Count | Frequency (%) |
| f04q10332219 | 1 | < 0.1% |
| f05q20004455 | 1 | < 0.1% |
| f06q30357730 | 1 | < 0.1% |
| f07q40192786 | 1 | < 0.1% |
| f05q30012809 | 1 | < 0.1% |
| a06q30002868 | 1 | < 0.1% |
| f06q40000894 | 1 | < 0.1% |
| f06q20001301 | 1 | < 0.1% |
| f04q30007791 | 1 | < 0.1% |
| f06q40005198 | 1 | < 0.1% |
| Other values (5305) | 5305 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14727 | |
| 3 | 5453 | 8.5% |
| 2 | 5419 | 8.5% |
| Q | 5315 | 8.3% |
| 1 | 5254 | 8.2% |
| F | 4970 | 7.8% |
| 4 | 4849 | 7.6% |
| 7 | 3883 | 6.1% |
| 6 | 3753 | 5.9% |
| 8 | 3707 | 5.8% |
| Other values (3) | 6450 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 53150 | |
| Uppercase Letter | 10630 | 16.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 14727 | |
| 3 | 5453 | 10.3% |
| 2 | 5419 | 10.2% |
| 1 | 5254 | 9.9% |
| 4 | 4849 | 9.1% |
| 7 | 3883 | 7.3% |
| 6 | 3753 | 7.1% |
| 8 | 3707 | 7.0% |
| 5 | 3372 | 6.3% |
| 9 | 2733 | 5.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Q | 5315 | |
| F | 4970 | |
| A | 345 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 53150 | |
| Latin | 10630 | 16.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 14727 | |
| 3 | 5453 | 10.3% |
| 2 | 5419 | 10.2% |
| 1 | 5254 | 9.9% |
| 4 | 4849 | 9.1% |
| 7 | 3883 | 7.3% |
| 6 | 3753 | 7.1% |
| 8 | 3707 | 7.0% |
| 5 | 3372 | 6.3% |
| 9 | 2733 | 5.1% |
Latin
| Value | Count | Frequency (%) |
| Q | 5315 | |
| F | 4970 | |
| A | 345 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 63780 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 14727 | |
| 3 | 5453 | 8.5% |
| 2 | 5419 | 8.5% |
| Q | 5315 | 8.3% |
| 1 | 5254 | 8.2% |
| F | 4970 | 7.8% |
| 4 | 4849 | 7.6% |
| 7 | 3883 | 6.1% |
| 6 | 3753 | 5.9% |
| 8 | 3707 | 5.8% |
| Other values (3) | 6450 |
program
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 9 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 50000 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 9 |
|---|---|
| 2nd row | 9 |
| 3rd row | 9 |
| 4th row | 9 |
| 5th row | 9 |
Common Values
| Value | Count | Frequency (%) |
| 9 | 50000 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 50000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 50000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 50000 |
relief_refi
Boolean
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 44685 |
| Missing (%) | 89.4% |
| Memory size | 97.8 KiB |
| True | |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 5315 | 10.6% |
| (Missing) | 44685 |
prop_val
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 9 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 50000 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 9 |
|---|---|
| 2nd row | 9 |
| 3rd row | 9 |
| 4th row | 9 |
| 5th row | 9 |
Common Values
| Value | Count | Frequency (%) |
| 9 | 50000 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 50000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 50000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 50000 |
interest_only
Boolean
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.0 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 50000 |
MI_cancel
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 9 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 50000 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 9 |
|---|---|
| 2nd row | 9 |
| 3rd row | 9 |
| 4th row | 9 |
| 5th row | 9 |
Common Values
| Value | Count | Frequency (%) |
| 9 | 50000 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 50000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 50000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 50000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 50000 |
| credit_score | first_pay | mat_date | msa | mortgage_ins_pct | orig_cltv | orig_dti | orig_upb | orig_ltv | orig_int | prop_zip | orig_loan_term | fha | unit_num | occupancy | channel | prop_type | loan_purp | borrower_num | seller | servicer | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| credit_score | 1.000 | -0.052 | -0.077 | 0.020 | -0.068 | -0.183 | -0.217 | -0.001 | -0.167 | -0.177 | 0.034 | -0.050 | 0.012 | 0.000 | 0.000 | 0.005 | 0.000 | 0.019 | 0.006 | 0.000 | 0.000 |
| first_pay | -0.052 | 1.000 | 0.573 | 0.012 | 0.006 | 0.104 | 0.172 | -0.050 | 0.099 | 0.135 | 0.001 | -0.057 | 0.012 | 0.000 | 0.000 | 0.026 | 0.000 | 0.023 | 0.005 | 0.068 | 0.066 |
| mat_date | -0.077 | 0.573 | 1.000 | 0.041 | 0.073 | 0.240 | 0.180 | 0.132 | 0.235 | 0.460 | 0.045 | 0.740 | 0.073 | 0.000 | 0.038 | 0.069 | 0.053 | 0.144 | 0.058 | 0.079 | 0.070 |
| msa | 0.020 | 0.012 | 0.041 | 1.000 | -0.027 | -0.050 | 0.025 | 0.056 | -0.044 | 0.012 | 0.188 | 0.043 | 0.025 | 0.043 | 0.048 | 0.065 | 0.116 | 0.055 | 0.003 | 0.084 | 0.079 |
| mortgage_ins_pct | -0.068 | 0.006 | 0.073 | -0.027 | 1.000 | 0.398 | 0.019 | -0.003 | 0.421 | 0.071 | -0.017 | 0.089 | 0.154 | 0.010 | 0.050 | 0.059 | 0.016 | 0.188 | 0.031 | 0.054 | 0.041 |
| orig_cltv | -0.183 | 0.104 | 0.240 | -0.050 | 0.398 | 1.000 | 0.224 | 0.136 | 0.941 | 0.224 | -0.016 | 0.214 | 0.128 | 0.013 | 0.069 | 0.078 | 0.025 | 0.235 | 0.053 | 0.046 | 0.035 |
| orig_dti | -0.217 | 0.172 | 0.180 | 0.025 | 0.019 | 0.224 | 1.000 | 0.083 | 0.198 | 0.179 | 0.030 | 0.094 | 0.091 | 0.000 | 0.021 | 0.256 | 0.031 | 0.335 | 0.014 | 0.229 | 0.196 |
| orig_upb | -0.001 | -0.050 | 0.132 | 0.056 | -0.003 | 0.136 | 0.083 | 1.000 | 0.102 | -0.071 | 0.051 | 0.220 | 0.019 | 0.056 | 0.103 | 0.107 | 0.072 | 0.076 | 0.104 | 0.072 | 0.077 |
| orig_ltv | -0.167 | 0.099 | 0.235 | -0.044 | 0.421 | 0.941 | 0.198 | 0.102 | 1.000 | 0.234 | -0.014 | 0.210 | 0.177 | 0.033 | 0.087 | 0.067 | 0.029 | 0.273 | 0.069 | 0.048 | 0.037 |
| orig_int | -0.177 | 0.135 | 0.460 | 0.012 | 0.071 | 0.224 | 0.179 | -0.071 | 0.234 | 1.000 | 0.014 | 0.443 | 0.062 | 0.058 | 0.231 | 0.027 | 0.033 | 0.126 | 0.073 | 0.072 | 0.065 |
| prop_zip | 0.034 | 0.001 | 0.045 | 0.188 | -0.017 | -0.016 | 0.030 | 0.051 | -0.014 | 0.014 | 1.000 | 0.063 | 0.048 | 0.052 | 0.087 | 0.146 | 0.166 | 0.088 | 0.044 | 0.197 | 0.175 |
| orig_loan_term | -0.050 | -0.057 | 0.740 | 0.043 | 0.089 | 0.214 | 0.094 | 0.220 | 0.210 | 0.443 | 0.063 | 1.000 | 0.072 | 0.000 | 0.038 | 0.068 | 0.051 | 0.141 | 0.058 | 0.077 | 0.069 |
| fha | 0.012 | 0.012 | 0.073 | 0.025 | 0.154 | 0.128 | 0.091 | 0.019 | 0.177 | 0.062 | 0.048 | 0.072 | 1.000 | 0.000 | 0.053 | 0.039 | 0.069 | 0.365 | 0.300 | 0.092 | 0.084 |
| unit_num | 0.000 | 0.000 | 0.000 | 0.043 | 0.010 | 0.013 | 0.000 | 0.056 | 0.033 | 0.058 | 0.052 | 0.000 | 0.000 | 1.000 | 0.189 | 0.008 | 0.034 | 0.024 | 0.013 | 0.023 | 0.023 |
| occupancy | 0.000 | 0.000 | 0.038 | 0.048 | 0.050 | 0.069 | 0.021 | 0.103 | 0.087 | 0.231 | 0.087 | 0.038 | 0.053 | 0.189 | 1.000 | 0.023 | 0.080 | 0.120 | 0.029 | 0.057 | 0.050 |
| channel | 0.005 | 0.026 | 0.069 | 0.065 | 0.059 | 0.078 | 0.256 | 0.107 | 0.067 | 0.027 | 0.146 | 0.068 | 0.039 | 0.008 | 0.023 | 1.000 | 0.066 | 0.107 | 0.025 | 0.519 | 0.401 |
| prop_type | 0.000 | 0.000 | 0.053 | 0.116 | 0.016 | 0.025 | 0.031 | 0.072 | 0.029 | 0.033 | 0.166 | 0.051 | 0.069 | 0.034 | 0.080 | 0.066 | 1.000 | 0.105 | 0.084 | 0.108 | 0.103 |
| loan_purp | 0.019 | 0.023 | 0.144 | 0.055 | 0.188 | 0.235 | 0.335 | 0.076 | 0.273 | 0.126 | 0.088 | 0.141 | 0.365 | 0.024 | 0.120 | 0.107 | 0.105 | 1.000 | 0.069 | 0.110 | 0.101 |
| borrower_num | 0.006 | 0.005 | 0.058 | 0.003 | 0.031 | 0.053 | 0.014 | 0.104 | 0.069 | 0.073 | 0.044 | 0.058 | 0.300 | 0.013 | 0.029 | 0.025 | 0.084 | 0.069 | 1.000 | 0.046 | 0.032 |
| seller | 0.000 | 0.068 | 0.079 | 0.084 | 0.054 | 0.046 | 0.229 | 0.072 | 0.048 | 0.072 | 0.197 | 0.077 | 0.092 | 0.023 | 0.057 | 0.519 | 0.108 | 0.110 | 0.046 | 1.000 | 0.779 |
| servicer | 0.000 | 0.066 | 0.070 | 0.079 | 0.041 | 0.035 | 0.196 | 0.077 | 0.037 | 0.065 | 0.175 | 0.069 | 0.084 | 0.023 | 0.050 | 0.401 | 0.103 | 0.101 | 0.032 | 0.779 | 1.000 |
| credit_score | first_pay | fha | mat_date | msa | mortgage_ins_pct | unit_num | occupancy | orig_cltv | orig_dti | orig_upb | orig_ltv | orig_int | channel | ppm | amort | prop_state | prop_type | prop_zip | loan_id | loan_purp | orig_loan_term | borrower_num | seller | servicer | super_conforming | prr_loan_seq_num | program | relief_refi | prop_val | interest_only | MI_cancel | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 795 | 200903 | N | 202402 | NaN | 0 | 1 | P | 60 | 12 | 157000 | 60 | 4.500 | R | N | FRM | MN | SF | 56600 | F09Q10000013 | C | 180 | 2 | Other sellers | U.S. BANK N.A. | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 1 | 766 | 200903 | Y | 203902 | 24020.0 | 0 | 1 | P | 80 | 31 | 128000 | 80 | 5.750 | R | N | FRM | NY | SF | 12800 | F09Q10000078 | P | 360 | 1 | Other sellers | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 2 | 701 | 200903 | N | 203902 | 13380.0 | 0 | 1 | P | 74 | 61 | 162000 | 74 | 5.625 | R | N | FRM | WA | SF | 98200 | F09Q10000148 | C | 360 | 1 | Other sellers | U.S. BANK N.A. | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 3 | 791 | 200903 | N | 203902 | 36540.0 | 0 | 1 | P | 64 | 38 | 184000 | 64 | 5.625 | R | N | FRM | NE | PU | 68100 | F09Q10000154 | N | 360 | 1 | Other sellers | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 4 | 725 | 200903 | N | 203902 | 36540.0 | 0 | 1 | P | 85 | 29 | 130000 | 80 | 5.500 | R | N | FRM | NE | SF | 68100 | F09Q10000180 | N | 360 | 2 | Other sellers | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 5 | 770 | 200904 | N | 203903 | 41500.0 | 0 | 1 | P | 75 | 25 | 195000 | 75 | 5.125 | R | N | FRM | CA | PU | 93900 | F09Q10000181 | C | 360 | 2 | Other sellers | U.S. BANK N.A. | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 6 | 805 | 200905 | N | 203904 | 29100.0 | 0 | 1 | P | 75 | 38 | 108000 | 75 | 5.125 | B | N | FRM | WI | SF | 54600 | F09Q10000183 | C | 360 | 1 | Other sellers | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 7 | 645 | 200904 | N | 203903 | NaN | 0 | 1 | P | 72 | 41 | 160000 | 72 | 5.625 | R | N | FRM | KY | SF | 40700 | F09Q10000216 | C | 360 | 2 | Other sellers | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 8 | 763 | 200903 | N | 203902 | 23844.0 | 0 | 1 | P | 80 | 27 | 148000 | 80 | 5.500 | R | N | FRM | IN | SF | 46300 | F09Q10000255 | C | 360 | 2 | Other sellers | U.S. BANK N.A. | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 9 | 748 | 200903 | N | 203902 | NaN | 0 | 1 | S | 80 | 43 | 320000 | 80 | 5.625 | R | N | FRM | MI | SF | 49700 | F09Q10000449 | P | 360 | 2 | Other sellers | CENTRAL MORTGAGE COMPANY | NaN | NaN | 9 | NaN | 9 | N | 9 |
| credit_score | first_pay | fha | mat_date | msa | mortgage_ins_pct | unit_num | occupancy | orig_cltv | orig_dti | orig_upb | orig_ltv | orig_int | channel | ppm | amort | prop_state | prop_type | prop_zip | loan_id | loan_purp | orig_loan_term | borrower_num | seller | servicer | super_conforming | prr_loan_seq_num | program | relief_refi | prop_val | interest_only | MI_cancel | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 49990 | 741 | 201001 | N | 203912 | 11244.0 | 0 | 1 | P | 20 | 16 | 153000 | 20 | 5.125 | R | N | FRM | CA | SF | 90700 | F09Q40451505 | N | 360 | 1 | Other sellers | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 49991 | 734 | 200912 | Y | 203911 | 17900.0 | 0 | 1 | P | 80 | 36 | 52000 | 80 | 5.125 | R | N | FRM | SC | SF | 29000 | F09Q40451550 | P | 360 | 1 | BRANCH BANKING & TRUST COMPANY | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 49992 | 658 | 201002 | Y | 204001 | 17900.0 | 0 | 1 | P | 97 | 42 | 114000 | 97 | 5.250 | R | N | FRM | SC | SF | 29200 | F09Q40451574 | P | 360 | 1 | BRANCH BANKING & TRUST COMPANY | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 49993 | 709 | 201001 | Y | 203912 | NaN | 0 | 1 | P | 100 | 36 | 76000 | 100 | 5.375 | R | N | FRM | VA | SF | 22600 | F09Q40451608 | P | 360 | 1 | BRANCH BANKING & TRUST COMPANY | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 49994 | 784 | 201002 | Y | 204001 | 40220.0 | 0 | 1 | P | 100 | 25 | 78000 | 100 | 5.250 | R | N | FRM | VA | SF | 24100 | F09Q40451610 | P | 360 | 1 | BRANCH BANKING & TRUST COMPANY | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 49995 | 749 | 201002 | Y | 204001 | 16740.0 | 0 | 1 | P | 100 | 38 | 121000 | 100 | 4.875 | R | N | FRM | NC | SF | 28000 | F09Q40451618 | P | 360 | 1 | Other sellers | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 49996 | 749 | 201001 | Y | 203912 | 19060.0 | 0 | 1 | P | 100 | 43 | 90000 | 100 | 5.250 | R | N | FRM | WV | SF | 26700 | F09Q40451777 | P | 360 | 1 | Other sellers | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 49997 | 775 | 201001 | Y | 203912 | 17900.0 | 0 | 1 | P | 100 | 34 | 126000 | 100 | 5.000 | R | N | FRM | SC | SF | 29200 | F09Q40451784 | P | 360 | 1 | Other sellers | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 49998 | 709 | 200912 | Y | 203911 | 16740.0 | 0 | 1 | P | 100 | 40 | 104000 | 100 | 5.250 | R | N | FRM | NC | SF | 28000 | F09Q40451811 | P | 360 | 1 | Other sellers | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |
| 49999 | 705 | 201002 | Y | 204001 | NaN | 0 | 1 | P | 93 | 34 | 65000 | 93 | 4.875 | R | N | FRM | PA | SF | 15900 | F09Q40451949 | P | 360 | 2 | Other sellers | Other servicers | NaN | NaN | 9 | NaN | 9 | N | 9 |